Exploiting the Omission of Irrelevant Data

نویسندگان

  • Russell Greiner
  • Adam J. Grove
  • Alexander Kogan
چکیده

Most learning algorithms work most eeectively when their training data contain completely speciied labeled samples. In many diagnostic tasks, however, the data will include the values of only some of the attributes; we model this as a blocking process that hides the values of those attributes from the learner. While blockers that remove the values of critical attributes can handicap a learner, this paper instead focuses on blockers that remove only irrelevant attribute values, i.e., values that are not needed to classify an instance, given the values of the other unblocked attributes. We rst motivate and formalize this model of \superruous-value blocking," and then demonstrate that these omissions can be useful, by proving that certain classes that seem hard to learn in the general PAC model | viz., decision trees and DNF formulae | are trivial to learn in this setting. We also show that this model can be extended to deal with (1) theory revision (i.e., modifying an existing formula); (2) blockers that occasionally include superruous values or exclude required values; and (3) other corruptions of the training data. This is an extended version of the paper, \Dealing with (Intentionally) Omitted Data: Exploiting Relative Irrelevancies", which appears in working notes of the 1994 AAAI Fall Symposium on \Relevance", New Orleans, November 1994. We gratefully acknowledge receiving helpful comments from R.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Knowing what doesn't Matter: Exploiting the Omission of Irrelevant Data

Most learning algorithms work most e ectively when their training data contain completely speci ed labeled samples In many diagnostic tasks however the data will include the values of only some of the attributes we model this as a blocking process that hides the values of those attributes from the learner While blockers that remove the values of critical attributes can handicap a learner this p...

متن کامل

Depth Improvement for FTV Systems Based on the Gradual Omission of Outliers

Virtual view synthesis is an essential part of computer vision and 3D applications. A high-quality depth map is the main problem with virtual view synthesis. Because as compared to the color image the resolution of the corresponding depth image is low. In this paper, an efficient and confided method based on the gradual omission of outliers is proposed to compute reliable depth values. In the p...

متن کامل

Task-Irrelevant Novel Sounds Improve Attentional Performance in Children With and Without ADHD

Task-irrelevant salient stimuli involuntarily capture attention and can lead to distraction from an ongoing task, especially in children with ADHD. However, there has been tentative evidence that the presentation of novel sounds can have beneficial effects on cognitive performance. In the present study, we aimed to investigate the influence of novel sounds compared to no sound and a repeatedly ...

متن کامل

تحلیل فضایی ـ زمانی مدیریت مخاطرات آنتروپوژنیکی معادن در ایران

The appearance of Hazards in human life is affected by natural and human forces. So far, human beings were the most powerful stimulant to create these hazards and to intensify them. The negative role of human beings in environment is caused by factors like lack of knowledge, weak reaction, technology lack, aggressive ideologies and competition; in social system, however, human behavioral engine...

متن کامل

چالشهای تعریف جرم قاچاق در نظام حقوقی ایران

In Iranian legislation history, term of "smuggling" uses in different means. First, smuggling was equivalent with act or omission that lead to disregarding any taxes, duties or exclusive limited areas. In over time, this word focused on customs smuggling and illegal exporting and importing commodity. Lack of clear definition of smuggling causes dispersion of votes and viewpoints. On the one han...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996